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METHODS AND APPARATUS FOR IMPROVING VOICE QUALITY 



IN AN ENVIRONMENT WITH NOISE 



RELATED APPLICATIONS 
[0001] This application claims the benefit of U.S. Provisional Application No. 60/457,945, filed on 
March 27, 2003, entitled "METHODS AND APPARATUS FOR IMPROVING VOICE QUALITY IN A 
NOISY ENVIRONMENT", under 35 U.S.C. 1 19(e). 

FIELD OF THE INVENTION 
[0002] The present invention relates to speech signal management. More specifically, the present 
invention relates to improving the voice quality of a speech signal received by a listener in an 
environment with noise. 

BACKGROUND OF THE INVENTION 
[0003] The present invention can improve the voice quality of a speech signal received or heard by a 
person listening on a phone in an environment with noise. Particularly in the case where a listener is 
using a wireless phone, a listener may have difficulty listening to speech signals the listener receives 
because the listener is in an environment with noise, such a city environment where there is a lot of street 
noise. A natural response for a listener being in an environment with noise is to raise his or her voice in 
level and pitch. This response has been called the Lombard Effect. An existing approach to solving this 
problem of listening on a phone in an environment with noise is that the listener raises the volume of the 
ear piece of the phone. Existing approaches to solving the problem require the listener make adjustments 
to his or her speech and manual adjustments to the phone in order to help make the speech signal received 
by the listener dominate over the noise in the listener's environment or help make the listener feel as 
though he or she is returning a coherent speech signal amidst the environment with noise. An 
improvement to existing approaches does not require the listener to make such adjustments. 
[0004] The present invention borrows the ideas of the Lombard Effect and a listener raising the 
volume of the ear piece to improve the voice quality of a speech signal received by a listener on a phone 
in an environment with noise. Accordingly, the present invention determines when the listener is in an 
enviromnent with noise and based on the level of noise in the listener's environment, processes the speech 
signal received by the listener. As a result of this processing, the speech signal received by the listener 
can dominate over the noise in the listener's environment. Thus, the goal of the present invention is to 
make the speech signal received by a listener easier to hear and understand. 
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SUMMARY OF THE INVENTION 
[0005] According to a first exemplary embodiment, a method for improving a downlink signal 
received by a listener on a phone is disclosed. The method includes calculating an environment noise 
level of the listener and filtering and adjusting gain of the downlink signal based on the environment 
noise level. 

[0006] According to a second exemplary embodiment, a method for improving a downlink signal 
received by a listener on a phone is disclosed. The method includes calculating an environment noise 
level of the listener, delaying the downlink signal if the environment noise level is less than a first 
threshold and filtering and adjusting gain of the downlink signal if the environment noise level is higher 
than a second threshold. 

[0007] An apparatus for improving a downlink signal received by a listener on a phone is disclosed. 
The apparatus includes a noise level calculator that calculates an environment noise level of the listener, a 
filter that filters the downlink signal and a gain controller that adjusts gain of the filtered downlink signal 
based on the environment noise level. 

DESCRIPTION OF THE DRAWINGS 
[0008] The present invention is illustrated by way of example and not by way of limitation in the 
figures of the accompanying drawings, in which like references indicate similar elements and in which: 
FIG. 1 is a block diagram of a voice quality improvement system, according to an exemplary embodiment 
of the present invention; 

FIG. 2 is a flow diagram illustrating a method for managing a speech signal, according to an exemplary 
embodiment of the present invention; and 

FIG. 3 is a graph illustrating the scaled frequency response of a filter employed by an exemplary 
embodiment of the present invention, 

DETAILED DESCRIPTION 
[0009] By processing speech signals in the downlink direction of a phone network, the present 
invention can improve the intelligibility of the speech signal received by a listener on a phone. The 
present invention determines when a listener on a phone is in a high ambient noise environment and 
makes adjustments to the level and frequency response of a downlmk speech signal received by the 
listener to improve the listener's intelligibility of the speech signal. An exemplary embodiment of the 
present invention samples the noise level in the uplink signal firom a listener on a phone (picked up for 
example by the microphone on a phone) and processes the speaker's downlink signal to the listener on the 
phone. 
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[0010] An exemplary embodiment processes a speaker's downlink signal by filtering and adding 
gain to the downlink signal. Filtering includes adding emphasis to any high frequencies of the downlink 
signal; this can allow the speech to stand out from environmental noise (that is often predominantly low 
frequency noise). Filtering also includes removing any low frequency content of the downlink signal; this 
can allow the embodiment to add more gain without clipping. In filtering, what is considered a high 
frequency and a low frequency is predetermined and dependent upon a particular application. For 
exanq)Ie, the filter of the exemplary embodiment considers frequencies between 0-900 Hz to be low and 
frequencies between 900-4000 Hz to be high. The amount of filtering and gain that is applied is 
predetermined and dependent upon the level of environmental noise. For example, if the level of 
environmental noise is low, gain and filtering are not applied. What is considered low or high 
environmental noise is predetermined and dependent upon a particular application. 
[0011] FIG. 1 is a block diagram of a voice quality improvement system, according to an exemplary 
embodiment of the present invention. A speaker's downlink signal is transmitted to voice quality 
improvement system 100 at line 105. The signal is then transmitted at line 125 so that it can be processed 
by delay line 135 and gain controller 145 and output from gain controller 145 as a first processed signal at 
line 150. The speaker's downlink signal is also transmitted at line 130 so that it can be processed by filter 
140 and gain controller 145 and output from gain controller 145 as a second processed signal at line 155. 
Regarding filter 140 and delay line 135, an exemplary embodiment employs an FIR filter with 15 FIR 
filter taps for filter 140 and a 7 frame delay line for delay line 135 and an alternative embodiment 
employs an DR filter and a 0 frame delay line for delay line 135. Adder 160 combines the first processed 
signal and the second processed signal. The voice quality improvement system 100 transmits the signal 
through line 110. 

[0012] A listener's uplink signal is transmitted to voice quality improvement system 100 at line 115. 
The signal is then transmitted to noise level calculator 165 at line 170. Noise level calculator 165 can use 
the signal to calculate the noise level of the environment in which the listener's phone is being used. The 
noise level calculation is transmitted to gain controller 145 at line 175. Gain controller 145 can use the 
noise level calculation to adjust the amount of gain applied to the speaker's downlink signal or control the 
amount of processing performed on the signal based upon the noise level calculation. The listener's 
uplink signal is transmitted from voice quality improvement system 100 at line 120. 
[0013] FIG. 2 is a flow diagram illustrating a method for managing a speech signal, according to an 
exen^lary embodiment of the present invention. At 200, the noise level of the environment in which 
listener's phone is being used is calculated. An exemplary embodiment calculates a slow moving or long 
time average of the noise level of the listener's uplink signal so that any changes in processing a voice 
quality improvement system are gradual. This embodiment can employ time constants that slow down 
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the noise level average, for example a rise time constant of 200 ms and a fall time constant of 0.6 ms. An 
alternative embodiment employs a signal level averaging technique. For example, if the listener's uplink 
signal is larger than the current noise level average, the current noise level average is slightly increased. 
If the listener's uplink signal is smaller than the current noise level average, the average is greatly 
reduced. As a result of using a signal level averaging technique, the noise level average can gradually 
increase in response to background noise and can drop rapidly in response to absence of background 
noise. 

[0014] At 205, the noise level is compared with a low noise threshold. The low noise threshold is a 
predetermined value, dependent upon a particular application. For example, -50 dBm could be a 
predetermined low noise threshold. If the noise level is less than the low noise threshold, next is 210. 
Otherwise, next is 215. At 210, the speaker's downlink signal is routed to and processed by a delay line 
and a gain controller. An exemplary embodiment could use delay line 135 and gain controller 145 shown 
in FIG. 1. At 215, the noise level is compared with a high noise threshold. The high noise threshold is a 
predetermined value, dependent upon a particular application. For exanq)le, -25 dBm could be a 
predetermined high noise threshold. If the noise level is higher than the high noise threshold, next is 220. 
Otherwise next is 230. At 220, the speaker's downlink signal is routed to and processed by a filter and 
gain controller. An exemplary embodiment could use filter 140 and gain controller 145 shown in FIG. 1. 
At 230, predetermined values of gain are applied to the speaker's downlink signal or a predetermined 
amount of processing is performed on the signal depending upon the noise level. The predetermined gain 
values or the processing amounts can be adjusted in response to a particular application. For example, 
predetermined gain values could be applied as follows: 

If noise level is less than -45 dBm and greater than or equal to -50 dBm, gain applied is -3 dB. 

If noise level is less than -40 dBm and greater than or equal to -45 dBm, gain applied is -6 dB. 

If noise level is less than -35 dBm and greater than or equal to -40 dBm, gain applied is -12 dB. 

If noise level is less than -30 dBm and greater than or equal to -35 dBm, gain applied is -18 dB. 

If noise level is less than -25 dBm and greater than or equal to -30 dBm, gain applied is -24 dB. 
As a result of executing 230, the speaker's downlink signal is partially routed to and processed by a delay 
line and a gain controller (using for example delay line 135 and gain controller 145 shown in FIG. 1) to 
generate a first processed signal and partially routed to and processed by a filter and gain controller (using 
for example filter 140 and gain controller 145 shown in FIG. 1) to generate a second processed signal. In 
executing 230, the delay line helps to synchronize the first and second processed signals. 
[0016] FIG. 3 illustrates the scaled frequency response of an exemplary embodiment filter such as 
for example filter 140 shown in FIG. 1. FIG. 3 shows that the filter is a high pass filter that adds 
emphasis or high frequency energy to the higher frequencies in the speaker's downlink signal to inq)rove 
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intelligibility of the signal. To add emphasis, the filter contains a gain stage so that the high pass filter 
produces a gain in the pass band. The gain stage is a predetermined gain at a predetermined fi-equency or 
range of frequencies and dependent upon a particular application. For example, FIG. 3 illustrates a gain 
stage that amplifies the signal by 9 dB at frequencies between 1600-4000 Hz. The filter also subtracts 
low frequency energy from the speaker's downlink signal. The amount of low fi-equency energy 
subtracted is a predetermined amount at a predetermined firequency or range of fi'equencies and dependent 
upon a particular application. For example, FIG. 3 illustrates a subtraction of 10 dB from the signal at a 
frequency of about 500 Hz. The coefficients of the exemplary embodiment filter before scaling are as 
follows in floating point format: 

H( 1)= -0.31150279E-03= H( 15) 

H( 2) = 0.32880172E-01 = H( 14) 

H( 3)= 0.54618171E-01 = H( 13) 

H( 4)= 0.26821463E-01 = H( 12) 

H( 5)= -0.50330563E-01 = H( 11) 

H( 6)= -0.12736719E+00= H( 10) 

H( 7)= -0.165I1263E+00= H( 9) 

H( 8) = 0.72750000E+00 = H( 8). ; 
[0017] An exemplary embodiment implements voice quality improvement system 100 with 
microcode or software for a digital signal processor or ASIC. 

[0018] In the foregoing description, the invention is described with reference to specific example 
embodiments thereof. It will, however, be evident that various modifications and changes may be made 
thereto, without departing ft-om the broader spirit and scope of the present invention. For example, some 
of the steps illustrated in the flow diagram may be performed in an order other than that which is 
described. It should be appreciated that not all of the steps illustrated in the flow diagrams are required to 
be performed, that additional steps may be added, and that some of the steps may be substituted with 
other steps. Also, embodiments of the present invention may be provided as a computer program product, 
or software, that may include a machine-readable medium having stored thereon instructions. Further, a 
machine-readable medium may be used to program a conq)uter system or other electronic device and the 
readable medium may include, but is not limited to, floppy diskettes, optical disks, CD-ROMs, and 
magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, magnetic or optical cards, flash memory, or 
other type of media/machine-readable medium suitable for storing electronic instructions. The 
specification and drawings are accordingly to be regarded in an illustrative rather than in a restrictive 
sense. 
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